Numerical methods for optimal control of piecewise deterministic Markov processes

نویسنده

  • F. Dufour
چکیده

Scientific Research context: In 1980, M.H.A. Davis [1] introduced in probability theory Piecewise Deterministic Markov Processes (PDMP) as a general class of models suitable for formulating optimization problems in queuing and inventory systems, maintenance-replacement models, investment scheduling and many other areas of operation research. In the continuous-time context, stochastic control theory has from the numerical point of view, been mainly concerned with Stochastic Differential Equations (SDEs in short). From the practical and theoretical point of view, the numerical developments for this class of processes are extensive and largely complete. It capitalizes on the connection between SDEs and second order partial differential equations (PDEs in short) and the fact that the properties of the latter equations are very well understood. It is, however, hard to deny that the development of computational methods for the control of PDMPs has received little attention. One of the main reasons is that the role played by the familiar PDEs in the diffusion models is here played by certain systems of integro-differential equations for which there is not (and cannot be) a unified theory such as for PDEs as emphasized by M.H.A. Davis in his book. To the best of our knowledge, there is only one attempt to tackle this difficult problem by O.L.V. Costa and M.H.A. Davis. The originality of this project consists in studying this unexplored area.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Numerical method for optimal stopping of piecewise deterministic Markov processes

We propose a numerical method to approximate the value function for the optimal stopping problem of a piecewise deterministic Markov process (PDMP). Our approach is based on quantization of the post jump location – inter-arrival time Markov chain naturally embedded in the PDMP, and path-adapted time discretization grids. It allows us to derive bounds for the convergence rate of the algorithm an...

متن کامل

Uniform Assymptotics in the Average Continuous Control of Piecewise Deterministic Markov Processes : Vanishing Approach

This result has been generalized by Feller (cf. [12], XIII.5) to the case of uncontrolled deterministic dynamics in continuous time, [2] to deterministic controlled dynamics, etc. A further generalization (cf. [19]) allows the limit value function with respect to a system governed by controlled deterministic dynamics to depend on the initial data. In the Brownian diffusion setting, similar resu...

متن کامل

Optimal Control of Piecewise Deterministic Markov Processes with Finite Time Horizon

In this paper we study controlled Piecewise Deterministic Markov Processes with finite time horizon and unbounded rewards. Using an embedding procedure we reduce these problems to discrete-time Markov Decision Processes. Under some continuity and compactness conditions we establish the existence of an optimal policy and show that the value function is the unique solution of the Bellman equation...

متن کامل

Optimal control of infinite-dimensional piecewise deterministic Markov processes and application to the control of neuronal dynamics via Optogenetics

In this paper we define an infinite-dimensional controlled piecewise deterministic Markov process (PDMP) and we study an optimal control problem with finite time horizon and unbounded cost. This process is a coupling between a continuous time Markov Chain and a set of semilinear parabolic partial differential equations, both processes depending on the control. We apply dynamic programming to th...

متن کامل

A finite difference method for piecewise deterministic Markov processes

An extension of non-deterministic processes driven by the random telegraph signal is introduced in the framework of piecewise deterministic Markov processes [9], including a broader category of random systems. The corresponding Liouville-Master Equation is established and the upwind method is applied to numerical calculation of the distribution function. The convergence of the numerical solutio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011